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ABSTRACT 


Application  of  the  Pearson  chi-square  test  to  goodness  of  fit  of  a 
distribution  often  leads  to  serious  difficulties,  particularly  in  the 
formation  of  intervals  (as  in  the  case  of  a  continuous  distribution)  and  in 
the  estimation  of  unknown  parameters.  Under  suitable  conditions  and  with 
appropriately  constructed  estimators  of  the  parameters,  the  test  statistic 
converges  in  distribution  to  that  of  chi-square  as  the  sample  size 
increases.  In  the  present  paper,  a  comparatively  simple  least-squares 
approximation  to  the  minimum  chi-square  estimator  is  developed  which,  when 
appropriately  implemented,  results  in  an  asymptotic  chi-square  distribution  of 
the  test  statistic.  This  estimator  is  developed  for  the  cases  of  fixed  and 
random  intervals,  and  the  role  of  the  underlying  assumptions  is  studied  in 
detail. 


AMS  (MOS)  Subject  Classifications:  62E20,  62F12,  62F03 

Key  Words:  Pearson  chi-square  test,  test  of  fit,  asymptotic  distribution, 
least-squares  approximation  .  _ 
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SIGNIFICANCE  AND  EXPLANATION 


The  research  considered  here  falls  within  the  area  of  statistics  known  as 
tests  of  goodness  of  fit.  Since  many  widely  used  statistical  procedures  are 
based  on  particular  distributional  assumptions,  it  is  quite  possible  in 
practice  that  the  resulting  analysis  can  lead  to  erroneous  or  misleading 
conclusions.  In  virtually  every  area  where  statistics  is  applied,  e.g. 
reliability  of  a  product,  prediction  of  performance,  evaluation  of  a 
treatment,  etc.,  some  sorts  of  assumptions  are  usually  made,  either  explicitly 
or  implicitly. 

One  of  the  most  widely  used  tests  of  goodness  of  fit  is  that  known  as  the 
Pearson  chi-square  test.  Many  articles  on  this  subject  have  appeared  in  the 
statistical  literature,  dealing  with  such  problems  as  the  construction  and 
number  of  intervals  to  be  used  in  the  test,  and  the  effect  of  estimators  of 
unknown  parameters  on  its  asymptotic  distribution.  In  the  present  paper,  a 
comparatively  simple  estimator,  based  on  least-squares,  is  developed  which, 
under  suitable  conditions  precisely  formulated,  result  in  an  asymptotic 
distribution  of  the  test  statistic  which  is  that  of  chi-square.  The 
usefulness  of  this  result  is  in  improving  the  application  of  the  Pearson  chi- 
square  test  by  utilizing  the  simpler  asymptotic  chi-square  distribution  which 
obviates  the  difficulty  of  trying  to  employ  a  test  statistic  whose  asymptotic 
distribution  depends  on  unknown  parameters. 


LEAST-SQUARES  APPROXIMATION  TO  MINIMUM  CHI-SQUARE  ESTIMATORS  OF  LOCATION  AND 


SCALE  PARAMETERS  AND  THEIR  EFFECT  ON  THE  PEARSON  CHI-SQUARE  TEST 
A.  E.  Muhly  and  John  Gurland 


1.  Introduction 

Suppose  that  a  random  sample  of  size  N  from  a  population  with  unknown  distribution 
function  Ft • )  is  given  and  that  it  is  desired  to  test  the  composite  null  hypothesis 
<1.1)  Hq  :  F(-)  -  Fo(*»0). 

In  (1.1)  it  is  assumed  that  the  functional  form  of  FQ  is  completely  specified  while  the 

parameter  vector  9,  which  is  an  element  of  an  open  set  0  contained  in  s-dimensional 

Euclidean  space,  is  unknown  and  unspecified.  In  addition,  it  is  assumed  that  for  every 

9  in  0,  F  (x;9)  is  continuous  in  x. 
o 

To  apply  the  Pearson  chi-square  test  to  the  above  problem  it  is  necessary  to:  i) 
partition  the  support  of  FQ  into  k  Intervals,  and  ii)  estimate  the  unknown  value  of 
9.  When  the  endpoints  of  the  intervals  are  specified  prior  to  observing  the  sample  (the 
fixed  interval  case)  it  is  well  known  (see  Fisher  (1924),  Cramer  (1946),  Rao  (1958),  and 
Birch  (1964))  that  if  the  minimum  chi-square  (or  asymptotically  equivalent)  estimator  of 
0  is  employed  then  the  limit  distribution  of  the  resulting  test  statistic  is  the  chi- 
square  distribution  with  k-s-1  degrees  of  freedom.  Furthermore,  Chibisov  (1971), 
pursuing  a  conjecture  of  Watson's  (1958,  1959),  showed  that  this  result  remains  true  when 
the  interval  endpoints  are  allowed  to  depend  on  the  sample  (the  random  interval  case), 
provided  that  these  endpoints  are  treated  as  fixed  for  the  purpose  of  obtaining  this 
estimator. 

The  practical  significance  of  the  above  results  is  diminished,  however,  since  it  is 
rarely  possible  to  obtain  closed  form  expressions  for  the  minimum  chi-square  estimator. 
This  fact  led  to  a  discussion  of  the  effect  that  other  methods  of  estimation  have  on  the 
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limit  distribution  of  the  Pearson  chi-square  test  statistic.  In  the  fixed  Interval  case 


the  definitive  result  along  these  lines  is  due  to  Chernoff  and  Lehmann  (1954)  who  showed 
that  if  the  method  of  maximum  likelihood  based  on  the  original  (i.e.  ungrouped)  sample  is 
employed  to  estimate  8  then  the  desired  limit  distribution  is  the  same  as  the 
distribution  of 


(1.2) 


'k-s-1 


s 

1 

i»1 


2 

where  Xk  s  1  is  a  random  variable  distributed  as  chi-square  with  k-s-1  degrees  of 

freedom,  Z,,...,zs  are  independent  standard  normal  random  variables  distributed 

2 

independently  of  Xk_s_1 »  and  Xj,...,X  ,  which  are  the  characteristic  roots  of  a 
certain  matrix,  satisfy  0  <  X  <  1.  Roy  (1956)  and,  working  independently,  Watson  (1957, 
1958)  showed  that  this  result  also  holds  true  in  the  random  interval  case  with  one 
important  distinction.  While  it  is  generally  the  case  that  the  X^  appearing  in  (1.2) 
depend  on  the  unknown  value  of  0,  if  F0  depends  only  on  unknown  location  and  scale 
parameters  and  if  random  interval  boundaries,  formed  as  in  (3.17)  below,  are  employed  then 
the  X^  appearing  in  (1.2)  do  not  depend  on  8. 

Other  estimators,  satisfying  fairly  general  regularity  conditions,  have  also  been 
considered  in  the  random  interval  case  (see,  in  particular,  Roy  (op.  clt.)  and  Moore  and 
Spruill  (1975)).  The  resulting  limit  distribution  of  the  Pearson  chi-square  test 
statistic  when  these  estimators  are  used  is  similar  in  form  to  the  distribution  of  (1.2) 
except  that  now  the  X^  appearing  there  need  not  be  bounded  above  by  one.  Furthermore, 
it  is  generally  the  case  that  the  X^  depend  on  8  although,  as  shown  by  Pahiya  and 
Gurland  (1972),  this  is  not  the  case  when  only  location  and  scale  parameters  are  involved 
and  they  are  estimated  by  the  ungrouped  sample  mean  and  standard  eviation. 

Published  attempts  to  contend  with  the  problems  raised  in  the  preceding  paragraphs 
include  the  following.  If  the  number,  k,  of  intervals  employed  is  large  then  it  is 
possible  to  use  either  i)  Cramer's  (op.  cit.)  approximation  to  the  minimum  chi-square 


estimator  for  location  and  scale  parameters,  or  ii)  Watson’s  (1958)  arqument  for  ignoring 

s  2 

the  term  }.  X.Z,  appearing  in  (1.2).  If,  on  the  other  hand,  k  is  moderate  or  small 
i=1 

then,  as  suggested  by  Rao  and  Robson  (1974)  and  generalized  by  Moore  (1977),  it  is 
possible  to  alter  the  quadratic  form  which  defines  the  Pearson  chi-square  statistic  to 
obtain  a  limiting  chi-square  distribution.  Alternatively,  if  it  is  known  that  (1.2)  does 
not  depend  on  9,  then  percentiles  of  its  distribution  can  be  computed  by  means  of 
Laguerre  aeries. 

The  solutions  mentioned  above  are  not  entirely  adequate.  First,  it  is  demonstrated 

in  Dahiya  and  Gurland  (1973)  that  the  power  of  the  Pearson  chi-square  test  may  be 

drastically  reduced  by  using  a  large  number  of  intervals.  This  observation  is  further 

supported  by  Spruill  (1976a)  who  uses  a  different  method  for  comparing  power.  Therefore, 

it  is  not  desirable  to  assume,  a  priori,  that  k  is  large.  Second,  in  some  cases  of 

interest  the  elements  of  the  matrix  of  the  modified  quadratic  form  used  by  Rao  and  Robson 

(op.  c it.)  and  the  X1  appearing  in  (1.2)  pose  significant  computational  problems. 

Finally,  it  is  possible  that  the  use  of  the  minimum  chi-square  estimator  leads  to  a  more 

powerful  test  than  any  of  the  solutions  mentioned  above  (see  Chibisov  (op.  cit.),  Moore 

and  Spruill  (op.  cit.),  and  Spruill  (1976b)). 

In  this  paper  it  is  shown  how,  in  some  situations,  it  is  possible  to  obtain  a  least 

squares  approximation,  ^N,  to  the  minimum  chi-square  estimator  9^  as  defined  in  (3.5) 

and  (3.22)  below.  Initially,  it  will  be  assumed  that  FQ  depends  only  on  unknown 

location  and  scale  parameters.  It  will  be  seen  that  8^  is  easy  to  compute  and,  when 

random  intervals  are  employed,  has  a  particularly  nice  interpretation.  Furthermore,  it 

will  be  shown  that  8„  and  are  asymptotically  equivalent  in  the  sense  that 

N  N 

9  -9  =  o  (1//N),  and,  therefore,  that  the  limit  distribution  of  the  Pearson  chi- 

N  N  p 

square  statistic  is  the  chi-square  distribution  with  k-3  degrees  of  freedom  when  ?N  is 
used. 

The  next  section  introduces  some  conventions  that  will  be  used  throughtout  this  paper 
and  presents  the  assumptions  that  are  required.  Section  3  is  devoted  to  a  description  of 


-  \ 


the  notation  that  will  be  used.  In  addition,  known  results  are  recorded  which  are 

required  in  later  sections.  In  section  4,  8^  is  presented  for  the  random  interval  case 

and,  in  section  5,  the  necessary  modifications  required  to  define  8  in  the  fixed 

interval  case  are  given.  In  section  6  the  asymptotic  properties  of  8  and  the  resulting 

N 

test  statistics  are  derived.  The  assumptions  given  in  section  2  are  extended,  in  section 
7,  to  treat  some  distributions  which  involve  unknown  parameters  not  of  the  location  and 
scale  variety.  Finally,  an  appendix  is  included  where  certain  technical  points  are 
derived. 

2.  Conventions  and  Assumptions 

In  this  paper  the  following  conventions  will  be  employed.  First,  all  vectors 
appearing  here  are  column  vectors  and  the  notation  A'  will  be  used  to  denote  the 

transpose  of  the  vector  (or  matrix)  A.  In  addition,  if  A  is  a  vector  then 

1/2 

lAl  »  (A' A)  ‘  .  Second,  if  aQ,a1 , . . .  .a^.i  .ajj  are  given  numbers,  the  notation 

Aai  =  ai  -  ai_1  for  i  =  1,...,k  will  be  used.  If,  however,  aQ  ak  are  not  defined 

or  do  not  enter  into  the  discussion,  they  will  be  treated  as  zero  and  the  notation 
Aai  “  ai  “  ai-1  for  *  “  1,...,k  will  be  used.  Thus,  A®,  “  Aa^  -  -a^.^  and 

A®1  «  al  -  aj_i  for  i  -  2,...,k-1.  Third,  if  XN  is  a  sequence  of  random  arrays  such 

that  the  elements  of  f(N)XN  converges  in  probability  to  0  as  N  tends  to  infinity, 
the  notation  XN  =  op(1/f(N))  will  be  employed.  Similarly,  if  XN  is  a  sequence  of 
random  arrays  and  if  each  element  of  f(N)XN  remains  bounded  in  probability  as  N  tends 
to  infinity,  then  the  notation  XN  =  Op(1/f(N))  will  be  used.  Finally,  if  the  p- 
dimensional  random  vector  X  has  the  multivariate  normal  distribution  with  mean  vector 
li  and  covariance  matrix  $,  this  will  be  denoted  by  X  ~  tMp,$).  In  addition,  if 
XN  is  a  sequence  of  random  vectors  converging  in  distribution  to  X  (where 
X  ~  Np(u,t))  as  N  tends  to  infinity,  then  this  will  be  denoted  by  X^  +  N^(u,^). 
Similarly,  if  the  random  variable  X  has  a  chi-square  distribution  with  r  degrees  of 
freedom  the  notation  X  -  will  be  employed,  while  XN  ^  means  that  the  sequence  of 


-4- 


random  variables,  XN,  converges  in  distribution  to  a  random  variable  X  (with  X  »  xr) 
as  N  tends  to  infinity. 

The  results  obtained  in  sections  3  through  6  are  based  on  the  following  assumptions. 
Assumption  1 :  There  exists  a  parameterization  of  Fq(x;9)  such  that 
i)  9  *  {0  s  91  e  R  and  02  >  0} 

ii)  F  (x»9)  =*  F  (9  +9  x)  where  F  (•)  =  F  («;9)  for  0  =  (0,1)'. 
o  o  1  2  o  o 

Assumption  2:  F0(x)  is  continuously  differentiable  in  x  and  fQ(x)  =  dFQ(x)/dx  is 

positive  for  all  finite  x. 

In  addition  to  these  assumptions  on  FQ,  it  will  be  necessary  to  show  that  various 

estimators  of  9  satisfy  the  following  condition. 

*  * 

Definition ;  An  estimator  9^  of  9  will  be  said  to  satisfy  condition  A  if  there 

2 

exists  a  function  h  :  R  -*■  R  such  that 

i)  I  h(x)dF  (x;9°)  ”  0  and  J  h(x)h(x)'dF  (x>9°)  =  G  where  the  elements  of 
R  °  R  ° 

G  are  finite  and  G  is  positive  semi -definite. 

**  1  N 

ii)  eN  -  9°  -  m  l  h<V  *  °p<V/N). 

a=1 

Here,  9°  denotes  the  true  unknown  value  of  9  and  x1,...,xN  denotes  the  original 
sample. 

The  following  comments  are  in  order,  i)  Since  the  problem  of  primary  interest  is  the 

test  of  Hq  and  not  the  estimation  of  0,  how  F0  is  parameterized  is  not  a  significant 

consideration.  Thus,  for  example,  the  Cauchy  distribution,  with  the  density 

(X/w )  [X^+(x-h  \  satisfies  assumptions  1  and  2  with  FQ(x)  “  V2  +  --  tan“^(x), 

9^  *  -n/X,  and  02  =  1/X.  ii)  In  both  the  fixed  and  random  interval  cases,  in  order  to 

compute  8  ,  a  preliminary  estimate,  9  ,  of  9°  is  required  which  satisfies 
N  N 

#  00 

9  -  9°  =  O  (1//n).  If  I  x4d  F  (xj9°)  is  finite,  then,  under  the  parameterization 

N  p  O 

imposed  by  assumption  1,  a  natural  choice  for  9^  is  given  by 
(2.1,  9;>n  «  and  9^  = 


where  X 

N 


1  1 

N  L 


and 


a=  1 


i  K  (VV: 

a°1 


However,  in  general  it  is  not  required 
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CD 

,  4  o  * 

•hat  j  x  d  F^(x;6  )  exists,  nor  is  it  required  that  9^  be  given  by  (2.1).  iii)  An 

— ao 

immediate  consequence  of  assumption  2  is  that  F~^(t)  is  continuously  differentiable 
for  0  <  t  <  1  and  that  dF^ftJ/dt  is  given  by  1/fQ ( F~ 1 ( t ) ) .  Thus,  it  follows  from 
Taylor's  theorem  with  Cauchy  remainder  that  for  0  <  t  <  1  and  0  <  6  <  1, 

(2.2)  F~’  ( t )  =  F_  1  ( 6  )  +  <t-«)/f  <F-1<5))  +  r  ( t. ,  6  )  , 

o  o  o  o 

where 

(2.3)  sup  |r(t,6)|  *  o(  T  )  as  T  >  o. 

|t-(Sl<T 

iv)  It  will  be  shown  in  the  appendix  that  assumptions  1  and  2  are  sufficient  to  ensure 
that  the  assumptions  given  in  section  2  of  ChiMsov  (op.cit.  )  are  satisfied. 


3.  The  Quadratic  Form  Minimized  by  §N  and  Some  Prelimary  Results 

The  Pearson  chi-square  test  statistic  measures  deviations  from  HQ  by  considering 
the  difference  between  the  empirical  distribution  function,  FN<*>,  and  the  (possibly 
estimated)  hypothesized  distribution  function,  Fo(x;9°),  evaluated  at  k-1  points 
interior  to  the  support  of  FQ.  Here,  the  empirical  distribution  function  is  defined  as 


(3.1) 


FN(x) 


1  l 

N  L 
a=1 


i(V 


where  Ift(x)  denotes  the  indicator  function  of  the  set  A. 

For  the  fixed  interval  case,  these  points  are  specified  prior  to  observing  the  sample 
and  will  be  denoted  by 


(3.2) 

— 

<  C1  <  C2 

<  •  •  •  <  C 

ck-1 

<  +">, 

where  it 

will 

be  assumed  that  k 

>  4 .  Note 

that  if  c0 

and 

ck  are 

defined  to  be 

-°°  and 

-M» 

respectively,  then 

assumption 

2  guarantees 

that 

AFo«V 

6)  >  0, 

i  =  1 ,  . . . 

for  all  9  in  0. 

Define  the  (k-1 )-dimensional  random  vector  u(8)  by 
(3.3)  UjO)  -  Fn(c.)  -  Fo(ci,-9),  i  =  1 , . . .  ,k  -  1, 

-6- 


and  define  the  (k-1)  x  (k-1)  symmetric  matrix  D(9)  by 


(3.4)  D(8)  =  (F  (c  ; 6 ) [ 1 -F  (c  ,,  ..;0)1 

1  o  min(i,])  o  max(i,j) 

Then  the  minimum  chi-square  estimate,  ?  ,  is  defined  to  be  any  estimate  for  which  there 
exists  a  sequence  of  positive  constants  p^,  with  »  0  as  N  such  that 

(3.5)  Xj*<9  )  <  ir>f  +  P„ 

N  N  see  N  N 


2  ^  2 
X„(0)  »  N  £  <AF  (c.)  -  AF  ( c  .  ; 8 ) )  / AF  (cj;0) 
N  ,  N  i  o  l  oi 


-  Nu(0) ’D(0)  u(0 ) . 

L  t  B  ”  (b,  ^)  denote  the  k  x  2  matrix  with  b,  ^  given  by 
1  I  J  A  •  J 

(3.7)  bj  .  -  (Af  (c  (0°))  /z  (C  )6)|  i  "*  1,  . . .  ,k,  j  j  “  1,2, 

i,3  o  i  00.  o  i  0.0o 

and  let  Y(0)  denote  the  k-dimensional  random  vector  with 

(3.8)  Yi ( 0 )  “  Au^ (8 )/ [AFq ( C^  j  0 ) 3  ^  ,  i  •  1,...,k. 

If  ?N  satisfies  (3.5)  and  if  assumptions  1  and  2  hold  then  (Chibisov  (op.cit.)  Theorem 


0N  -  0°  =  (B,B)'1B'Y(8°)  +  op(1//N). 


Notice  that 


tr  F  (xi9)  =  f  (0  +9  x) 
o  U  ^  o  O  i  2 


W  Fo(x;0)  “  Xfo(0l+02X)- 


Thus,  if  the  (k-1)  x  (k-1)  symmetric  matrices  tl(0)  and  T(0)  are  defined  by 

n<9)  -  diag[fo(01+02c1 ),  ....  fo<01+02Ck-1  ’ 1 


r ( 9 )  =  n(9)-1D(8)n(8)_1, 


then  M.9)  can  be  written  as 


- -_l 


(3.12)  9„  -  8°  =  ((1,c),r(e0)*1(1,c)]-,(1,c)T'90)_1n(e°)_1u(60)  +  o  (1//N), 

N  P 

where  1  is  the  k-1  dimensional  vector  of  ones  and  c  =  (c^ , . . . , c^_ j ) 1 •  Therefore,  it 

follows  from  (3.12)  that  3  satisfies  condition  A  with 

N 

(3.13)  h(x)  =  ((i,c),r(80)"1(i,c)]_1(i,c)T(e0)'1n(90)_1ii(_a>|C](x)  -  F^oe0)) 
where,  for  any  vector  a  =  (aj , . . .  ,3^.1 ) 1 , 

(3-,4>  I(—#aJ  =  ‘I(-,.1l‘«> . I(-,ak_1)(x))’ 


and 

(3.15)  F  ( a ; 9° )  =  (F  (a  ;0°) , . . . , F  (a  ,6°))*. 

O  O  1  O  K—  i 

For  the  random  interval  case,  as  considered  here,  the  points  where  the  difference 
between  Fpj  and  FQ  is  observed  are  obtained  as  follows.  First,  prior  to  observing  the 
sample,  partition  the  interval  [0,1]  into  k(>  4)  intervals  and  let 

(3.16) 

denote  the  end  points  of  these  intervals.  Next,  define  the  functions 

(3.17) 

Thus,  for  the  parameterization  specified  by  assumption  1,  g^(0)  “  *qi'0l',/02  where 


0  -  6°  <  ...  <«°  <  «°  -  1 
o  1  k- 1  k 


qi(0)  ”  Fo1<5i’0>'  *  = 


c.  =  F_1(6°),  i  =  1,...,k-1.  Then,  after  observing  the  sample,  obtain  a  preliminary 
1  O  i 

•  * 

estimate,  0N,  of  0  and  compute  5(1!eN)  for  1  =  1,...,k-1.  The  difference  between 

* 

FN  and  Fc  is  then  observed  at  the  k-1  random  points 

For  0  and  3  in  0,  define  the  (k-1 )-dimensional  random  vector  u(5,0)  by 

(3.18)  u  <5,6)  =  FN(qi(®))  "  Fo(git®> ,0) '  1  “  1»***»k-1, 

and  define  the  (k-1)  x  (k-1)  symmetric  matrix  0(3,0)  by 


(3.19) 

Note  that  if  9=3  then 


0(0,9)  =  (F  (g  .  (J  <0 )  ;0  )  [1-F  (g  .,( 0 )  ;0  )])*"’  . 

o  mint i , j )  o  max(i,3)  i,3”1 


ui<9'0)  =  FN(gi(0))  ~  {1 


(3.20) 


D ( 0 , 9 )  =  D  »  (i 


,  [1-6 


!  ) 


k-1 


min(i,j)  max(i,j)  i , j  =  1 


Next,  for  9  and  3  in  0,  let  8^(3, 9)  denote  the  quadratic  form 


•  ■  1 


-8- 


(3.21) 


R2(5,0)  =  N  u(?,8) ,D(5,0)'1u(5,8 )  . 

N 


If  9  ,  denotes  the  preliminary  estimate  of  0  used  to  compute  the  random  interval 
N 

boundary  points,  then  §N  is  defined  to  be  any  estimate  for  which  there  exists  a  sequence 
of  positive  constants  p ,  where  *  0  as  N  *  »,  such  that 


(3.22) 


vvV  <  ve>>  +  v 


For  0  and  5  in  0  define  the  (k-1)  *  (k-1)  symmetric  matrices  (1(9,9)  and 
T  ( 5 , 9  )  by 

11(9,0)  =  diagtf  (8  +9,g.(8)),...,f  (9,+6-g.  ,(0)>] 

0121  012K-1 


r<5,9)  =  n(0,9)'1D(5,0)n(9,6)_1 


and  notice  that  when  8=5 


n(9,8)  =  It  =  diag  ( fQ  ( c  •)  ) - .f0<Sk-1)1 

(3-24) 

r(0,8)  =  r  =  n  on  . 

■nien,  if  8^  -  8°  =  o^(1),  if  assumptions  1  and  2  hold,  and  if  satisfies  (3.22)  it 

can  be  shown  (Chibisov  (op.cit)  Theorem  5.2)  that 

(3.25)  0N  -  9°  =  [(1,g(eO))T~1(1,g(0O))]'1(1,g(eO)),r_1n"1u(0°,9O)  +  o  (1/Vn) 


where 

,  6) 


g<0)  »  -g-  [c-8  ^  1  ] 


and  c 

*  (Cj,**** 

Ol 

X 

I 

)  ' .  Thus, 

it  follows  from  (3.25) 

with 

h(x)  given 

by 

(3.27) 

h  ( x)  = 

[Mr 

g(9°) )  T_1 

( 1 , g( 0°) )]-1( 1 , g(8°) ) 1 T 

where 

«°  = 

..  ,6 

°  )'. 
k-r 

(x)  -  6°)  , 


(-»,g(8  )  ! 


If  8  and  0  denote  two  (possibly  distinct)  estimates  of  ft,  then  the  Pearson 
N  N 

2  ** 

chi-square  test  statistic  is  given  by  )  in  the  fixed  interval  case  and  by 

2  *  ** 

R  (9  -9  )  in  the  random  interval  case.  In  order  to  determine  the  limit  distributions 

N  N  N 

of  these  statistics  under  H  #  it  is  necessary  to  investigate  the  limit  distributions  of 

u(9  )  and  u(  9, 9  ).  If  9  -  9°  =  o  (  1 )  and  9  satisfies  oonditi-'n  A  then 

N  N  N  N  p  N 
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Ill  follows  from  assumptions  1  and  2  note  that  by  Taylor's  theorem  with  Cauchy  remainder 


there  exists  an  a,  with  0  <  a  <  1,  such  that  if  5  =  fl°  +  a(0-0°)  then 

F  ( c  ;  0  )  =  F  ( c ,  j  0°  )  +  (6-0°  )  *  (  1  ,c.  )  *  f  ((1,c.)@°)  +  r  (0  ) 
oi  ox  loi  l 

where  r . ( 9  )  =  ( d-0° ) ' ( 1 ,c. ) [f  (<1,c,)5)  -  f  (  ( 1  ,c  )  0° )  ] .  Thus,  it  follows  from  the 
i  i  o  i  o  l _ 

Cauchy-Schwarz  inequality  that  |r^(0) |/I8  —9° <  <  / 1+c2  | f o( (1,c^)0)  -  f Q( ( 1 ,c^ >9°) | . 
Therefore,  since  5+0°  as  5+0°  as  I0-8°I  +  0  and  since  f0  is  continuous,  XII  is 


In  the  random  interval  case  specify 
(A. 6)  o  -  6°  <  6°  <  ...  <  6°_1  <  «°  -  1, 

and  for  e  >  0  let  B(e)  -  J («°-e ,6°+e )  and  define  s(e)  as  in  (A. 5).  Then  Chibisov 


I)  0  is  in  the  interior  of  0. 

II)  F  (xj0°)  is  continuous  in  x  for  x  8  S(c). 
o 

III)  There  exists  c  >  0  such  that  for  k  in  S(e),  F  (x;0)  is  differentiable 

o 

in  9  at  0°  and 

F  ( x  1 9  )  -  F  <xj9°)  +  (B-eVhtx)  +  r  (x,0  ) 
o  o 

where  h(x)  “  (h(x) , . . .  ,h8(x) ) ' ,  hj(x)  •  ®  |  g  „g0  > 

v,(  u)  -  hJF  ^(ujS0))  is  bounded  and  continuous  in  u  for  u  in  B(e), 

j  j  ° 

and  sup  |r(x,6)|  ”  o ( i 9  — 0 °B )  as  I9-0°I  ♦  0. 

*eS  (  6  )  _  / 1,  „ 

IV)  The  rank  of  B  »  ( Ah^ ( g^ (0° ) )//a8° )^_^  is  s. 

V)  For  any  t  >  0  there  exist  p  >  0  and  E  >  0  such  that  if  xA  8  S(6°,e) 


for  i  “  1 , 


,k-1  and  for  x 


*'  Xk  = 


9,  then  10=6  I  >  i  implies 


|af  (x, i9°)  -  a«°!  >  c. 
o  i  1 


1<i<k 

nearly,  assumptions  I,  II,  IV,  and  V  follow  from  assumptions  1  and  2  as  in  the  fixed 


interval  case.  To  show  III  the  following  notation  will  be  used.  Let 


P  *  min(S^,1-5^  j)  >  0  and  for  0  <  E  <  8  let 

D<£)  -  {x  :  (F~\s°-e)-8°)/8°  <  x  <  (F_1(5°  +e)-9°)/0°}  and  define 

0  112  O  K— I  1  2 

M (e)  »  max  /l+x2  which  is  finite  since  D(e)  is  compact.  Also,  for  t  >  0  and 


max 
x€D( E ) 
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APPENDIX 


The  purpose  of  this  appendix  is  to  show  that  the  assumptions  given  in  section  2  of 
Chibisov  (op.cit.)  (which  are  required  to  show  (3.9)  and  (3.25))  are  implied  by 
assumptions  1  and  2.  In  order  to  accomplish  this  first  note  that  for  all  0  in  0, 
assumptions  1  and  2  imply  that 

(  A  .  1  '  L-  F  (x,-0)  =  (1,x)*f  (  (  1  ,x)6  ) 

duo  o 

for  all  finite  x  and 


F_1  (u;0)  =  (F_1(u)  -  9,)/8- 


for  all  0  <  u  <  1  • 

In  the  fixed  interval  case  specify 


|  A.  3) 

and  define 
t.\ .  4  ) 


<C-<C0<..»<C,  .  <  +°° 


<5,  =  F  (c.  ;0  ),  i  -  1,...,k-1. 
i  o  i 


In  addition,  for  c  >  0  let  S(e)  =  S(A°,e)  where 

(A.M  S(<5°,e)  =  (x :  |  F  (x,0°)  -  6°|  <  e},  i  = 

i  o  l 

Then  Chibisov  assumes: 

I  )  9°  is  in  the  interior  of  0 


II ) 

F  ( x ; 0°  )  is 

o 

continuous  in  x 

i  r  i 

For  each  i 

=  1,2,.. .  ,k-1  ,  1 

F  ( c .  ;  0  )  =  F  ( 
O  1  o 

where  h ( c ^  ) 

=  (h1  (Ct  )  ,  .  .  .  ,hj 

and  I r ,  ( 0 )  | 
l 

=  o ( 1  0-0°B  )  as 

rv ) 

The  rank  of 

B  is  s  where 

V) 

For  any  T  > 

0  there  exists 

It 6-0°  1  >  r 

impl  i es  max  |  A F 

F  (c  ;9 ) 


r — K  3 

re  B  =  (Ah  (c  )//A6°) 

3  1  1  i  =  1  j=1 


r‘l«arly,  flssumpt  ions  1  and  2  imply  I  and  II  since  0  is  an  open  set  and  for  all 

1  in  0,  F  (x;9)  is  continuous  in  x  for  all  real  x.  In  addition,  (2.2)  shows  that  V 

o 

h  1  dp  ,-j; .*  since  k  >  4  the  rank  of  B  is  2  (see  (3.7))  which  shows  IV.  To  see  that 


9 


(7.5) 


»<9) 


-  diag(qo(81+92t(c1 )),...  ,qQ(  8^9^  ( c^, ) )  ]  ' 
and,  in  (5.7),  replace  fQ  with  q^,  FQ  with  Q0,  and  ci  with  tlc^.  Then,  if 
9  -  8°  -  o  ( 1 ) ,  8  is  qiven  by  (5.9)  and  the  results  of  section  6  hold. 

N  p  N 
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p(x) 


1  1 

(Y^YjHx/Yj)  exp  { - ( x/Y  j )  )  x  >  0 

0  x  <  0 


where  Y1  and  y2  are  positive,  satisfies  assumptions  3  and  4  with  A  *  (o,“), 

_  y 

Q0<y>  =  1_e  6  ,  9,  ”  “Y1  log  Y2>  02  ”  Y,,  and  t(x)  -  log(x). 

The  analysis  under  assumptions  3  and  4  is  precisely  the  same  as  under  assumptions  1 

and  2  with  the  following  modifications.  In  the  random  interval  case  i)  redefine 

c.  -  F"1 (6°)  and  g  (0)  =  <c  -8, )/9,  as 
i  o  l  l  i  1  2 


(7.1) 


vi(0)  «  (ci~ 61 )/®2 »  i  “  1#...,k-1 

g± ( 0 )  -  t-1(v  (0)), 


ii)  change  the  definition  of  *(8,0)  in  (3.23)  to 

(7.2)  *(0,0)  =  diagtq^O^v^S)) . qQ< 9 1+62Vk- 1 (5 ’  > 1  ' 

iii)  change  (4.4)  and  (4.5)  to 


(7.3) 


ci  +  <1/%(ci),[WV) 


1 f • •• f k- 1 


6(9)  »  y  -  [  1 , v ( 0 * ) ] 0 , 

N 

where  v(0)  *  (v,  ( 0 ) , . . .  ,v)c_1  (0  ) ) 1  ,  and  iv)  in  (4.10)  and  (4.11)  replace  fQ  by  q^  If 
0jj  ”  0  =  Op(1//N)  then  8^  is  given  by  (4.13)  and  the  results  of  section  6  hold.  In 

the  fixed  interval  case  replace  (5.3)  and  (5.4)  with 

yL  -  ( 1 » t ( c  ± ) )0*  +  (qo((1,t(c.))0*))'1[FN(ci)  -  Fq( ci ; 0* )  ] ,  i  -  1,....k-1 

(7.4) 

e(0)  »  y  -  [1,t(c))0 

where  t(c)  =  (t(c1 ) , . . .  ,t(ck-1 ) ) '  and  -  ®°  ”  0  (1//n),  change  the  definition  of 
*(0)  given  in  (3.11)  to 
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lemma  1,  ii)  the  elements  of  g{8)  are  continuous  in  8,  and  iii) 

(6.10)  RN{V9)  ~  RN<9N'5>  ”  +  ( 1 ,  g  ( 6*) )  <8-0  )  ) )  , 


(6.9)  shows  that 


(6.11) 


^(v5n>  -  M'V  -  v1'- 


Therefore,  it  follows  from  (6.3)  and  (3.27)  that  if  8  -  8°  «  0  (1//n)  and  if  8  is 

N  p  N 

given  by  (4.13)  then 


(6.12) 


-  ^vv  -  v1** 


for  any  estimate  ?  satisfying  (3.22). 

Similary,  if  0  -  8°  =  o  (1),  if  8  is  given  by  (5.6),  and  if  6  is  any 

N  p  N  N 

estimate  satisfying  (3.5),  then,  since  i)  the  elements  of  T(0)  1  are  continuous  in  8, 
ii)  e(0N)  “  O  (1//N)  by  lemma  2  and  (3.13),  and  iii) 


(6.13) 


xn(0)  "  xn(S)  “  Nt<ar-0),<1»c>*r<6N)1<E{5)  +  n,c)(5-8))]. 


it  follows  from  lemma  5,  (3.13)  and  (6.5)  that 


(6.14) 


-  V1^), 


-••V-V-’J 


(6.15) 


XN(8N>  -  XN(?N»  *  V11' 


(6.16) 


xh‘V  "  ^<V  “  V”* 


7.  An  Extension 

In  this  section  assumptions  1  and  2  are  modified  as  follows. 
assumption  3;  There  exists  a  parameterization  of  F0  such  that 
i)  0  *  { 0 : 0 f eR  and  82  >  0} 

ii)  The  support  of  ( • j 0 )  is  the  connected  set  A  for  all  8  in  6 
rii)  Fq(x;8)  -  Co(8^+82t(x) )  where  t  is  a  hoiueomorphisra  from  A  onto  R. 
Assumption  4;  Qj/y)  is  a  one  to  one  continuously  differentiable  function  of  y  and 
q^y)  “  Q0(y )  is  non-zero  for  all  finite  y. 

As  an  example,  the  Weibull  distribution,  which  has  the  density 
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xn(5n>  -  K(h'  ■  °p(1) 


(6.8,  X2(5N)  $  X2y_ 3. 

Lemma  5 :  i)  If  assumptions  1  and  2  hold,  if  8^  -  9°  =  0p(1//l1),  and  if  8^  is  given 
by  (4.13)  then  satisfies  condition  A  with  h(x)  given  by  (3.27). 

♦  q  m 

ii)  If  assumptions  1  and  2  hold,  if  0-0  *0(1),  and  if  0  is  given  by 

N  p  N 

(5.9)  then  8  satisfies  condition  A  with  h(x)  given  by  (3.13). 

N 

Proof:  i)  (4.5)  and  (4.8)  imply  that 

9  -  8°  =  [(1,g(8*))T'1(1,g(8*))]‘1(1,g(8*))*r"1e(90). 

N  N  N  N 

Thus,  since  the  elements  of  g(9)  are  continuous  in  8  and  since  e(8°)  »  O  (1//n) 

P 

if  8*  -  9°  =  Op(1/^N), 

8n  -  9°  -  [(1,g(8o))'r'1(1,g(8o),j"1(1,g(6o))*r'1e(0o,  +  o  (1//(T). 

Furthermore,  e(8°)  =  n~1u(8*,8°)  +  o  (1//N)  by  lemma  1  and  (see  Chibisov  (op.  cit.) 

N  p 

corollary  3.3)  u(8  ,9°)  -  u(8°,8°)  »  o  (1//il).  Thus, 

n  p 

§N  -  8°  -  ((1,g(80))T"1(1,g(80))]"1(1,g(90))T"1n_1u(80,8°)  +  op(1//N) 
which  is  the  same  as  (3.25). 

ii)  (5.4)  and  (5.6)  imply  that 

8„  -  8°  =  t(1,c)T(0*)"1(1,c))-1(1,c)’r(e*)-1e(0°) 

N  N  N 

*  [<i,c)#r(e0r1<i,c)r1(i,c),ne0r1e(e0)  +o  m//n) 

p 

since  i)  8*  -  8°  =  op(D,  ii)  the  elements  of  TO,  1  are  continuous  in  8,  and 

iii)  £(9°)  =  0p( 1//N)  by  lemma  2.  Furthermore,  (i)  of  lemma  2  shows  that 

e  (9° )  =  n(8°)_1u(90)  +  o  (1//5T).  Thus, 

P 

9N  -  9°  =  [(i,c),r<8or1(i,c)]'1<i,c),r<0or1n(9o)-1u(80)  +  op(i//N) 
which  is  the  same  as  (3.12).  ■ 

An  immediate  consequence  of  lemma  5  is  that  if  ®  ~  0  (1//N)  and  if  8^  is 


given  by  (4.13),  then 


K  -  K  ■  vl//?) 


for  any  estimate  8  satisfying  (3.22).  Thus,  since  i)  e(0.t)  “  0  (1//N)  by  (3.27)  and 
N  N  p 
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_  i  *  o 

Since  the  elements  of  r ( 9 )  are  continuous  in  9#  P(&M)  “  P(®  )  ■  o  (1)«  Thus,  since 

n  p 

e<9°)  -  y  -  ( 1  ,c)0°  and  since  P(0*)(1,c)  -  0,  e(SN>  -  P(9*)y  -  P(0*)e(0°>.  Thus, 

X^(8n >  -N  c(0°),P(fl*)T(0*)"1P(0*)e(0o)  so  that 

-  Ne(0°),P(0°),r(0o)-1P(0O  )e(6°)  -  op(1)  by  lemma  2.  Therefore,  since 
/n  e(0°)  2  ^0,7(6°))  and  since  r(0°)P(0°)  T(0°)  1 1> ( 0° )  is  idempotent  with  rank 

k-3,  the  result  follows.  ■ 

2  *  m 

In  order  to  derive  the  limit  distribution  of  not^ce  that  ti)  of  lemma  1, 

(3.21),  and  (4.7)  show  that 

(6.1)  R^(0*,0)  "  ^K(0M'0)  "  C(0n'9) 
where 

(6.2)  c(0*,0)  -  n  e(0) 1  tr (0*,0)-1  -  r"1]  e < 0 ) 

+  N  [  2e  ( 0  )  +  e (0*,9 ) )  T(0*,0)-1e(0*,0). 

Furthermore,  if  0  is  an  estimate  of  0  which  satisfies  condition  A  and  if 
N 

0  -  0°  -  O  (1//N),  lemma  1  and  the  continuity  of  the  elements  of  T(0  ,0)  show  that 

N  p  ” 

C(0*,0**)  -  o  (1).  Thus,  if  it  can  be  shown  that  0  ,  as  defined  in  (4.13),  satisfies 
N  N  p  N 

condition  A,  then 

(6.3,  ~  *X'V  “  °pM) 

and 

(6.4)  *X'V  *  Xk-3. 

Similarly, 

(6.5)  X^<0)  -  X^(0)  -  C ( 0 ) 
where 

(6.6)  c<0,  -  ne (0?' tr (er1  -  r(0*,"1)£(0) 

+  K  [  2e  ( 0  )  +  r  ( 0  ) )  T(0)*1r(0) 

*  o  .** 

and,  if  0M  -  0  »  o  (1)  and  9  satisfies  condition  A,  it  follows  from  lemma  2  and 

N  p  N 

the  continuity  of  the  elements  of  T(0)  1  that  C(0„  )  **  o  (1).  Thus,  if  8  ,  as 

N  P  N 


defined  in  (5.9),  satisfies  condition  A,  then 


carrying  out  the  arithmetic  in  (5.6),  it  is  seen  that 


(5.9) 


01.N  ’a(V  +  6(V 


3  =  8(6  ) 

2  ,N  N 


where  9^  is  a  preliminary  estimate  of  9  satisfying  8^  -  0  »  0^(1). 

6.  The  Asymptotic  Properties  of  8^,  (®n'^N> '  ant*  Xn*8n*' 

*2  *  ■  “2* 

The  following  two  lemmas  provide  the  limit  distributions  of  (9^,6^)  and 

under  HQ • 

Lemma  3 ;  If  assumptions  1  and  2  hold,  if  8^  -  0°  -  0^(1//n),  and  if  8^  i8  given  by 
(4.13)  then  under  HQ, 

"X'V  5  *ly 

Proof:  Note  first  that  the  (i,j)th  element  of  the  matrix 

[1,g(8))[(1,g<0))T"1(1,g(9)))"1(1,g(0))' 

is  given  by 

—  —  —  _  2 
(w3  -  (c1+Cj)w2  +  Cj^.jWjl/iw.jWj  -  w2). 

Thus,  the  (k-1)  x  (k-1)  matrix 

P  “  I  -  (1.g(9*)][(1,g(8*))T"1(1,g(9*))J-1(1,g{0*))T'1 

#  —  q  —  *  o 

does  not  depend  on  9  .  Furthermore,  since  e(0  )  «  y  -  (1,g(9  ))9  and  since 
N  N 

P[l,g(9  )]  -  0,  e(8  )  =  Py  =  P  e(9°).  Thus,  it  follows  from  lemma  1  that 
N  N 

/n  e(0„)  ^  N.  ,(o,Prp')  and,  since  (prp'JT  1  is  idempotent  with  rank  k-3,  the  result 
N  k- 1 

follows.  ■ 

Lemma  4:  If  assumptions  1  and  2  hold,  if  8„  -  9°  *  o_ (  1 ) ,  and  if  8„  is  given  by  (5.9) 

”™““—  N  p  N 


^N>  5  hi¬ 
proof:  Define  the  (k-1)  *  (k-1)  matrix  P ( 8 )  by 

.-1..  .1-1. 


P(0)  -  I  -  (1,c][(1,c)T(0)-1(1,c)J"1(1,c)T(0)'1. 
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**  O  >  **  Q  _ 

Furthermore,  since  8  -  8  “  0  (1//N),  so  that  r(F  (c,;8„  ),F„(c.i9  ))  -  o  (1//N), 

W  p  O  1  N  O  1  P 

r(8  )  -  o  (1//n)  and  (i)  is  proved.  (ii)  and  (iii)  now  follow  immediately  from  (i)  and 

N  p 

(3.28).  ■ 

If  r(0°)  were  known  then,  in  analogy  with  (4.7),  8  would  be  defined  as  that 

N 

estimate  of  9°  which  minimizes  the  quadratic  form  Ne(0)T(0o)  ^(S).  Since  r(0°)_1 

* 

is  not  known,  however,  it  is  necessary  to  estimate  it.  Thus,  let  8^  denote  the 
preliminary  estimate  of  8°  used  to  define  y,  and  define 
(5.5)  X2<0)  -  N€(0)  T(8*)_1e(8). 

5*2 

minimizes  X  (6)  with  respect  to  0  and  is  given  by 
N  N 

(5.6)  SN  -  t(i,c),r<0*)-,(i,c>r,n,c),r(0*r1y. 

In  analogy  with  section  4,  let 


(5.7) 


w^ (0 )  -  i ' r (0)-1 i 
w2<0>  «  i 1 r (8  )_1c 

w3(0)  *  cT(9)  ^c 
w^  (0 )  »  1T(0)  1  y 

Wj (8 )  *  c' T ( 0 )  \ 


k 

I 


i»1 


i“  1 


k 

1 


i«1 


k 

l 


i=»1 


k 

l 


(if  <e  +e  c  h2/af  (0,+e,c< 

O  1  2  i  O  I  2  1 

[Af  (01+9,0.  ))  [A(c  f  (0  +0  c  ) )  1/IF  (0  rt  c  ) 
o  1  2  i  ioi2i  o  1  2  i 

[A(c.f  (8 ,+0,c  ) ) ] 2/AF  ( 0  +8  c  ) 

1  O  1  2  i  0121 

[Af  (0  +8,c.  )]  [A(c  f  (8  +8-c  ))]/AF  (8  +8  c.  ) 
o12i  i  o  l  2  l  o  1  2  l 

[A(c  .  f  (8.+8,c,  )I  [A(y  f  (8  +8  c  .  ) )  1  /AF  (8  +8  c,  ) , 
l  o  1  2  i  x  o  1  2  x  o  1  2  i 


and  define 


(5,8) 


a(0)  -  tw3(0)w2(0)  -  w2(0)w3!0)3/!w1(0)w3(8)  -  w2<0)2) 


8(8)  -  (w^ejWjte)  -  w2(8)w2(0))/!w1(0)w3(8)  -  w2(6)2). 

Notice  that  yj^  is  a  consistent  estimate  of  8°  +  ®2ci*  not  ci'  80  thttt 
a (8)  and  8(9)  do  not  have  the  same  interpretations  as  in  section  4.  In  fact,  by 
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where  *^(0  )  3  +  ®2Ci**  However,  since  0  is  unknown,  in  order  to  define  the 

fixed  interval  analog  to  the  random  vector  y  it  is  first  necessary  to  estimate 
6  .  Thus,  let  0^  denote  a  preliminary  estimate  of  0  satisfying 
0^  -  0°  =  0p(1//N),  let  y  =  (y1,...,yk_1)'  where 

(5.3)  ^1  “  <1'Ci)0N  +  ’,i(0N)"1[FN<Cl)  "  Fo(Ci,9Nn'  i  -  1- - ,K-1, 

and,  for  0  in  0,  define  the  (k-1 (-dimensional  random  vector  e(9)  by 

(5.4)  e (6 )  -  y  -  ( 1 ,c] 0 . 

Then  the  following  lemma  records  some  properties  of  e(0). 

*  O  /—  ** 

Lemma  2:  If  8-0  =0  (1/*N),  if  0.,  satisfies  condition  A,  and  if  assumptions  1 

. — —  N  p  N 

and  2  are  satisfied,  then 

i)  e(0)  =  IT ( 0°)  'u ( 0 )  +  r(0)  where  r(0u  )  «  o  (1//n) 

"  p 

ii)  /n  e  (0** )  $  N,  ,  <o,n<0°)-1  1  (9°)n(0°)-1)  where  t  (0°)  is  defined  in 
N  k-1  1  o  o 

(3.28),  and 


iii)  >^N  e  ( 0°)  3  Nk_1(o,r<0°) ) . 


Proof:  Since  FQ(ci;0)  is  continuously  differentiable  in  0  with 

I?  W0)  -  ^WWi*' 

Vi  =  <1'Ci'9N  +  \(9N)"1tVCi)  -  Fo(Ci?90)  -  {Fo(VV  -  Fo(ci’0O,)1 
=  n»ei)0*  +  VV"1  [FN<ci>  "  Fo(ci’0°>  ■  V0°H1'ci>(eN"0O>  +  Vv/”» 

-  (1,c1)0°  +  s.(0°)-1 [FN(Ci)  -  Fo(Ci)0°)]  +  op(1//N). 

Note  that  the  last  equality  follows  from  the  fact  that  n ^  < 8 )  is  continuous  in  8  and 

8„  -  0°  =  o  (1).  Thus,  it  follows  from  (2.2)  and  (5.4)  that 
N  p 

Ci(0)  =  n.(0o)'1[FN(c.)  -  Fo(c.;0°)]-[F‘1(Fo(ci;0!)-F^1(Fo(ci,0O)))  +  op(1//N) 

-  n.(0°)_1(F  (c  )-F  (c.  ;0°)]-ti.  (8°)_1  (F  (c.|0)-F  (c.|0°)]-r(F  (c.;0);F  (c.;0°)) 
i  Nioi  x  ox  oi  oi  or 

+  op(  1//N) 

=  it  (0°)  1  u .  ( 6  )  +  r  ( 0 )  where  r(0)  =  -r(F  (c.)0),F  (c.»0°))  +  o  (1//n). 
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F  <g.  ( e  )  f  9°  )  -  <5°  -  f  ( o .  )  ( 1 ,  g .  ( e°  )  )  ( 9  -0° )  +  v<0,e°)  where  sup  |  v(9  ,8°)  |  -  o(r  ) 
°  1  i  °  1  1  I0-0°Kt 

as  T  +  o.  Thus, 


(4.6)  Fo<gi(6N>,e0)  “  Si  “  OptV/M) 

by  the  standard  arguments  due  to  Mann  and  Wald  (1943).  Therefore,  since 

FN(gi(9N))  "  6i  =  ui(0N'9O)  +  Fo(gi(9N,,0O)  “  “  Op(V/H)  by  (3.28)  and  (4.6), 

W«>  '  ’  WV*  -  6°i  -  vW‘>  “  °P(1//^  80  that 

r(F  (g. (8  );0  )»A°)  »  o  (1//N).  Thus,  e.<8  ,0  )  -  o  (1//N)  and  (i)  is  proved.  (ii) 

OlNNl  p  INN  p 

and  (iii)  now  follow  immediately  from  (3.28).  ■ 

A2  * 

Define  the  quadratic  form  Rjj  (0^,8)  by 

(4.7)  R^<0*,0)  -  N  e(0)T_1e(0). 

Then  the  least  squares  approximation  to  ?N»  8^,  minimises  R^fS^,©)  with  respect  to 
0.  Since  the  asymptotic  covariance  matrix  of  /n  e(8°)  is  T,  it  follows  from  the 
theory  of  the  general  linear  model  that  8R  is  given  by 

(4.8)  8N  -  [(1,g(0*))T"1(1,g(0*)))"1(1,g(8*))T*1?. 


In  order  to  simplify  (4.8)  first  note  that 

since 

the 

(i,j)  the  element  of  T-1  is 

given  by 

Co(°i)2[1/45i  +  v&5i+1] 

if 

U-il 

-  0 

,4.9)  r"’ j  -  - 

-fo(5i,fo(gi+1,/afii+1 

if 

|i-ll 

-  1,  i,j  -  1,...,k-1, 

O 

if 

U-il 

>  1 

straightforward  calculations  show  that 


I 


(4.10) 


I 


Wl  =  i  •  r~ 1 1 


k  7  r, 

l  iif  <c.)r/&5° 

ii,  °  1  1 


IT 


-1- 

c 


k 

\  [if  (3.  )]  [6(c  f  <£.  )  )]/6«° 

.  OX  1  O  1  1 


c  ’  r  ^  c 


k 

1 

i-1 


(A ( c  f  (c.))r/AA,, 
i  o  l  l 
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The  principal  idea  behind  the  least  squares  approximation  to  the  solution  of  (4.1)  is 


the  fact  that  for  all  9  in  0, 


(4.2) 


“  (,'gi(8N  '' 


1, . . .  ,k-1. 


Thus,  it  is  tempting  to  regress  the  (k-1) -dimensional  random  vector,  with  ith  component 

#  •  0 

given  by  F  (F  (g.(8  ))),  onto  the  matrix  [  1 , g < 0  ))  to  obtain  an  estimate  of  8  • 

O  N  1  N  N 

However,  |  F^1  (Fn(  gA  ( 0^))  )|  is  infinite  if  F[/9i*®N^  is  e<Iual  to  *®to  or  one,  and  this 

*  O 

can  happen  with  positive  probability.  But,  since  F.,(g,(0v,))  -  5,  «  0  (1//N)  if 

N  1  N  X  p 

9^  -  9°  =  o  n/Si)  (see  the  proof  of  lema  1  below),  (2.2)  shows  that 

(4.3)  Fo1(FN(gi(9N)n  "  =i  "  ("i)"1(FN(giteN))  "  6i)  +  op(V/S),  i  -  1 , . . .  ,k-1 , 

where  =  ^(c^),  and  (t^)  1(FN(gi<0  ))  -  ig)  is  finite  regardless  of  whether 

* 

F  (g, (0  ))  equals  zero  or  one. 

N  X  N 

Define  the  (k-1 ) -dimensional  random  vector  y  by 

(4.4)  Fi  ”  Si  +  (’,i)~1(FN<gi(8N)>  ‘  6i5'  1  "  1**«**lc-1, 

where  9*  satisfies  0*  -  0°  -  0  (1//N),  and  notice  that  y,  -  c,  -  o  (1//N).  Next, 

N  N  p  A  *  p 

for  0  in  0  define  the  (k-1 ) -dimensional  random  vector  2(8)  by 

(4.5) 

Some  properties  of  2(8)  are  recorded  in  the  following  lemma. 

Lemma  1 


2(8)  -  y  -  [1,g(0N)]0. 


If  0  -  9°  -  0  (1//N),  if  8  satisfies  condition  A,  and  if  assumptions  1 

N  p  N 

and  2  are  satisfied,  then 

„-1 


i)  e(9) 


n  '  u<8*,0)  +  e<8*,8**>  -  op(1//N), 


N'  N 


ii)  /n  £(9^  )  ^  Nk-l'8'^  ^o11  ^  where  is  defined  in  (3.28),  and 


iii)  /n  e ( 9°)  ^  Nk_1(0,D. 


Proofs  Since  w 


-1 


i/f  <c. >  *  Sr  F  <t) 
o  i  at  o 


it  f ol lows  from  (2.2)  that 


t*6  J 


2.(9) - iFN<gi<0N))  -  -  tF;1(Fo(gi(6N)’9,)  -  Fo1(6r)i 

»  "  ai]  ‘  <,i>’1(Fo(gi<eN>,'e>  '  6i!  '  r,Fo<git8N),9)'Si) 

«  (*.  )'V  ( 0*  ,6  )  +  e.(8*,8)  where  e.(0*,8)  -  -r  ( F  <  g.  (0*  )  f8  )  ,6° ) . 

11N  IN  IN  OIN  i 


'•ext,  it  follows  from  assumptions  1  and  2  and  Taylor’s  theorem  with  Cauchy  remainder  that 


assumptions  1  and  2  are  sufficient  to  show  that  (see,  e.g.,  Moore  and  Spruill  (op.cit.)) 

_  ** 


/n  u(0t7)  ^k.,(o,fo(e0 


> ) 


(3.28) 


ft  u(V9n*>  3  Nk-i(o'U- 

Here,  the  notation  and  |  (0°)  for  the  asymptotic  covariance  matrices  of 

*  * 

I'M  u(©N,0N  )  and  rN  u(0n  ),  respectively,  has  been  employed  to  stress  the  fact  that 


under  the  parameterization  imposed  by  assumption  1,  if  0^  and  0^  are  invariant  with 


respect  to  the  group  of  linear  transformations  with  positive  slope,  then  ^  does  not 


lepend  on  0°,  while  £^(9°)  does  depend  on  0°  regardless  of  whether  0^  is 


invariant  or  not.  For  the  present  purpose,  the  particular  form  of  $  and  $  (0°)  is 


not  important  except  when  8V,  *  0°,  in  which  case  t  =*  D  and  t  (0°)  =  D(0°). 

N  1  O  1  O 


2  *  *  . 

It  now  follows  from  (3.28)  that  the  limit  distributions  of  X  (8  )  and  R^(6^,9n 

are  the  same  as  the  distribution  of 


(3.29) 


k_1 

I  Vi 

i=  1 


where  z 


1 '  1 


,zt5_1  are  independent  standard  normal  random  variables  and  are 


the  characteristic  roots  of  D  ^ t  in  the  random  interval  case  and  D(B°)  'l  (8°)  in  the 

1  ni  '  n 


*  *  ~ 

fixed  interval  case.  Furthermore,  it  is  well  known  that  if  0^  =  0^  then  k-3  of  the 


are  one  while  the  remaining  two  X.  are  zero. 


4.  The  Least  Squares  Estimate  3^ :  The  Random  Interval  Case 

In  the  random  interval  case  an  estimate  9^  satisfying  (3,22)  can  be  found,  under 

2  * 

suitable  regularity  conditions,  by  equating  to  zero  the  derivatives  of  with 

respect  to  0  (while  treating  0  in  D ( 0^ ,0 )  ^  as  fixed)  and  solving  for  0.  This  leads 
to  the  system  of  equations 

(0.1)  (1,g(0*l)'H(e’,  6)0(0*,  8)_1  u(8*,8)  =  o 

N  N  N  N 

which  is  generally  difficult  to  solve  for  9  • 


0  <  e  <  8  let  B(9°,t)  «  {8:I0-9°I  <  T>  and  let 

A(e,i)  =  {y  ■  (1,x)8:0  e  B(9°,t),  x  e  D(c)}.  Finally,  let  B(e)  denote  the  closure  of 

the  set  B(e)  defined  in  the  line  following  (A. 6). 

It  follows  from  assumptions  1  and  2  that  for  any  e  with  0  <  e  <  6#  F  (x?0)  is 

o 

continuously  differentiable  in  0  at  0°  for  any  x  in  D(e).  Thus,  by  Taylor's 

theorem  with  Cauchy  remainder  there  exists  an  a  (0  <  a  <  1),  which  depends  on  x  and 

8,  such  that  for  5  =  9°  ♦  a  (8-9°) , 

F  (XJ0)  -  F  <x>0°)  +  (8-9°) * ( 1,x) ' f  <<1,x)0°)  +  r(x,0) 
o  o  o 

where  r(x,0)  =  ( 8-0°) • ( 1 ,x) ' [f  ((1,x)§)  -  f  ((1,x)9°)].  Furthermore,  since  0  <  e  <  8 

o  o 

implies  that  B(e)  is  contained  in  the  interval  (0,1)  and  since,  for  such  e, 

v(u)  =  (1,(F  ^(u)-0°)/0°)'f  (F  \u))  is  continuous,  and  thus  bounded,  on  5(6),  it 
O  1  z.  o  o 

follows  that  v(u)  is  -ontinuous  and  bounded  on  B(e). 

Choose  e.j  and  so  that  0  <  6^  <6  and  T  ^  >  0 •  By  the  Cauchy-Schwarz 

inequality 


for 

all 

(x,8) 

in 

0(6^  X  8(0°,^). 

Furthermore,  since  fQ 

is  continuous  on  A(E 1 ,T 1 ) 

and 

since 

A(e1 

-v 

is  compact,  fQ 

is  uniformly  continuous 

on  A(e  j ,T  ^ ) •  Thus,  given 

e2 

>  0 

(with 

e2 

<  there  exists 

t2  >  0  (with  t2  <  T1 

)  such  that 

|(1,x)5  -  (1,x)8°|  <  t2  implies  |f  ((1,x)8)  -  f  ({1,x)8°)|  <  Furthermore, 

since  |(1,x)0  -  (1,x)0°|  =  a  |  ( 1  ,x)  (8-9°  )  |  <  /l+x2  I9=0°l  <  M(e1)l8-0°l  for  x  in 

Dfe^,  1 0-8° I  <  t2/M(e1)  implies  that  If  ((1,x)8>  -  f  ((1,x)B°)|  <  e^M (e  1 )  for  all 

x  in  D(c^).  Therefore,  if  I9-0°I  <  x2/M(e^)  then 

sup  |r(x,0  )  | / 1 8—6° I  <  M(6 1  )e2/M(e  1 )  ■  e2  since  c  D(e.).  This  shows  that 

xes(e1 ) 

sup  |r(x,8)|  =  o(l0»8°l)  as  19  -  0°l  ♦  o  and,  therefore,  III  follows  from 
xes(e1  ) 

assumptions  1  and  2. 
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